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Abstract 

For t € [0,1] let H 2 [ntJ = {jni+j)YjLo denote the Hankel matrix of order 2[nt\ of a random 
vector (mi,... ,m 2 n) on the moment space At 2 n (I) of all moments (up to the order 2 n) of 
probability measures on the interval JcR. In this paper we study the asymptotic properties 
of the stochastic process {logdetf7 2 |ntj}te[o,i] as n —>• 00 . In particular weak convergence 
and corresponding large deviation principles are derived after appropriate standardization. 
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1 Introduction 

Hankel matrices are well studied objects in mathematics with applications in various fields such as 
orthogonal polynomials, random matrices or operator theory. Asymptotic properties of functions 
of non-random Hankel matrices such as the determinant, condition number or smallest eigenvalue 
have been studied by Hirschman Jr. (1966), Zamarashkin and Tyrtyshnikov (2001), Basor et al. 
(2001) or Berg and Szwarc (2011) among others. Recently, random Hankel matrices have also 
been considered in the literature with the main focus on matrices with independent entries. For 
example, Bryc et al. (2006) studied the limiting spectral measure of large Hankel (and Toeplitz) 
matrices, while some results regarding the operator norm can be found in Bose and Sen (2007). 
The present paper takes a different look at random Hankel matrices (more precisely, at their 
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log-determinants) with not necessarily independent entries. Our investigations are motivated by 
the fact that Hankel matrices are usually used to characterize the solution of classical moment 
problems. To be precise, let 7 C M denote an interval and define V(I) as the set of all probability 
measures on the Borel field of I with existing moments. For a measure /i G V (/) we denote by 


m k = m k (n) 



k = 0,1,2,... 


the /c-th moment and define 


(1.1) M(I) = {m(n) = (mi(/i),m 2 (/i), ...)*! // G V(I)} C 

as the set of all moment sequences. We denote by II n (n G N) the canonical projection onto the 
first n coordinates and call Ai n (I) = fl n (M(I)) C M n the n-th moment space. The Hamburger 
moment problem is to decide if a given sequence (m n ) ne pj is an element of A4(M) and it is well 
known that this is the case if and only if the Hankel matrices H_ 2k = (' m i+j)ij=o are nonnegative 
dehnite for all fceN [see Shohat and Tamarkin (1943)]. Moreover the vector = (mi ,..., m 2n ) 
is an element of the moment space A4 2n (M) if and only if the Hankel matrix H_ 2n = (mj +J j" J=0 
is nonnegative dehnite. Similar characterization can be obtained for the Sticltjes and Hausdorff 
moment problem corresponding to measures on the half line Mq = [0, oo) and the interval [0,1], 
respectively. 

Chang et al. (1993) considered the “classical” moment space corresponding to measures on the 
interval [0,1] [see Karlin and Shapeley (1953), Krein and Nudelman (1977), for some early refer¬ 
ences] and equipped A4 rt ([0,1]) with a uniform distribution. They proved asymptotic normality 
of an appropriately standardized version of a projection Hfc(m„) of a uniformly distributed vec¬ 
tor m n on _M n ([0,1]) as n —* oo. Gamboa and Lozada-Chang (2004) investigated corresponding 
large deviation principles, while Lozada-Chang (2005) studied similar problems for moment spaces 
corresponding to more general functions defined on a bounded set. More recently, some of these 
results have been generalized by Dette and Nagel (2012) to the moment spaces _M n ([0, oo)) and 
A4 n (M) corresponding to unbounded intervals. 

The present paper is devoted to the asymptotic analysis of Hankel determinants of random mo¬ 
ment vectors on Ai 2 n (I). For example, if riv 2n = (mi,..., m 2n ) denotes a random vector uniformly 
distributed on the 277-th moment space A4 2n ([0,1]), then it is shown in this paper that an appro¬ 
priately transformed and standardized version of the determinant of the random Hankel matrix 
H_ 2n = (rn i+ j)™ J=Q converges weakly, that is 

(1.2) -y= { log det H_ 2n — log det H^ n + } -^>^(0,1), 

\jn z 

where H_ 2n = (m° + ■)" - =0 denotes the Hankel determinant of the moments of the arcsine distribution 
on the interval [0,1], that is m^ = (^f)2~ 2 h Moreover, the sequence 

(1.3) - i { log det H 2n - log det H^ 2n } 
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satisfies a large deviation principle with a good rate function. It will be demonstrated in Section 
2 that the moments of the arcsine distribution used for the centering in (1.2) and (1.3) correspond 
to the center of the moment space A4([0,1]). 

Similar results are available for the moment spaces A4 2n ([0, oo)) and A42n(®0, where the centering 
has to be performed by the logarithms of the determinants of the Hankel matrices correspond¬ 
ing to the Marcenko-Pastur law and Wigner’s semi-circle law, respectively (in these cases the 
corresponding Hankel determinants H_ 2n have value 1). These measures play a very important 
role in the theory of random matrices, free probability and quantum probability, see the books of 
Hiai and Petz (2000) and Hora and Obata (2007) among others. 

The remaining part of this paper is organized as follows. In Section 2 we present some facts on 
moment theory and introduce random moment sequences on the spaces _M„([0,1]), A4 n ([0, oo)) 
and A4 2ri -i(®0- We also state some basic properties of these random variables which will be useful 
in the following discussion. In Section 3 it is shown that for the canonical distributions on the 
moment space A4 2n U) an appropriately standardized version of the stochastic process 

(1-4) {T> 2 |ntj}te[o,i] = {logdet// 2 |_ ni j}t e [ 0! i] 

converges weakly to a Gaussian process. The centering and scaling is different for the three 
moment spaces under consideration. We also study the asymptotic properties of the vector 
(D n> 2 ,..., D n} 2 kY f° r any fixed k. Large deviation principles are investigated in Section 4, while 
some technical results which are required for the proofs are provided in the Appendix. 


2 Some basic facts about moment theory 

Similar to cumulants, canonical moments provide a one-to-one transformation of the ordinary 
moments. They appear naturally in the continued fraction expansion of the Stieltjes transform of 
a probability measure but are less known than cumulants. Therefore, we state some basic facts in 
the following two paragraphs, where we distinguish between bounded and unbounded intervals. 

2.1 Canonical moments 

Canonical moments have been investigated in a series of papers by Skibinsky (1967, 1968, 1969) 
and roughly speaking define a one-to-one mapping from the set of moments Af([0,1]) (or more 
generally from At ([a, b\) for any finite interval [a, b] C M) onto the set [0,1] N . They have implicitly 
been discussed before in the work of Verblunsky (1935, 1936), who mainly considered measures on 
the unit circle. In this section we briefly present some basic facts for the sake of a self contained 
presentation and discuss corresponding results for the set A4([0,oo)) and A4(M). For details 
we refer to the monographs of Dette and Studden (1997) and Wall (1948). For a given vector 
mu -1 —(mi ,..., mk-i) T € Atfc-i([0,1]) of moments of a probability measure on the interval [0,1] 
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define 


\rn k (fi) 

/i G "P([0,1]) with 

/ t l dp(t) = rrtj for i — 1,. 

Jo 

■ ■,k- l| 

| m k (p) 

/i G "P([0,1]) with 

/ t l dp,{t) = rrii for i — 1,. 

Jo 

■ ■,k- lj 


Throughout this paper let Int C denote the interior of a set C. It is shown in Dette and Stridden 
(1997) that rrik = (mi, ... ,mk) T G IntA4fc([0, 1]) if and only if m k < m*, < m k . In this case the 
canonical moments of order l — 1,... ,k are defined as 


(2.1) Pl = Pl (m k ) mi + mi _ ; l = l,...,k. 

Note that for m* G Int A4fc([0,1]) we have pi G (0,1); l = 1 and that pk describes the 

relative position of the moment rrik in the set of all possible k -th moments with fixed moments 
m 1; ■ ■ ■ ,n^k- 1 - It can also be shown that the definition (2.1) defines a one-to one mapping from 
Int A4 n ([0,1]) onto the open cube (0, l) n . As an example consider the arcsine distribution p° on 
the interval [0,1] with density l/(7r \/x[l — x)), then the corresponding canonical moments are 
given by pi = 1/2 for all £ G N [see Dette and Studden (1997)]. Consequently, the sequence 
of moments of the arcsine distribution defines the center of the moment space A4([0,l]). Note 
however, that it is not the barycenter of the moment space. 

The determinant of the Hankel matrix H_ 2k = (m ? ; +J )C =0 of the moment vector (m 1; ..., m 2 k) can 
easily be expressed in terms of the corresponding canonical moments, that is 

k 

(2.2) det H 2k = det {m i+j ) k i j=0 = (piqip 2 ) k JJ (q 2j - 2 P 2 j-iq 2 j-iP 2 j) k ~ J+1 , 

3 =2 


where q 2 — 1 — pj [see Dette and Studden (1997), Theorem 1.4.10]. 

In the case / = [0, oo) the upper bound m k is in general not finite, but we can still define for a 
point rrik-i G Int A4fe_i([0, oo)) the lower bound 


m k = min < m fc (/i) 


/i G 'P([0, oo)) with / t l dp{t) = mi for i — 1,..., k — 1 


where m k = (m 1; ..., m k ) T G Int A4fc([0, oo)) if and only if rn k > m k . In this case, the analogues 
of the canonical moments are defined by the quantities 


(2.3) z, = m mi _ l = 1,..., k 

mi _1 - m l _ 1 

(with 7 Uq =0). As in the case of a bounded interval the definition (2.3) provides a one to one 
mapping from Int AI n ([0, oo)) onto (M + ) n , and it can be shown using similar arguments as in 
Dette and Studden (1997) that 

det H_ k = (mk — m k ) det H _ k _ 2 , k > 2. 
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Consequently, the determinant of the Hankel matrix is given by 

k 

(2.4) det H 2k = det(m i+j ) k j=0 = (z 2k _ 1 z 2k ) k ~ J+1 , 

3 = 1 

Finally, in the case / = M neither m jT nor m^ are in general finite. Nevertheless, there exists an 
analogue of the quantities Pi and z t defined in (2.2) and (2.4). To be precise, we define for a vector 
rrv 2 n _i = (mi ,..., m 2n _ i) £ -M 2n _i(M) with H_ 2n _ 2 > 0 the polynomial 


(2.5) P k (x) = 


m 0 ■■■ m fc _i 1 


m k 


m 2k ~i x* 


m 0 


m k -i 


[see Chihara (1978)]. We consider a one to one mapping 

( 2 . 6 ) 

defined by 


m k -1 


^ 2 fc -2 


Int A4 2n _i(M) —> (M x M+) n_1 x 
n^n-i '“t (&i, ai ,..., a n _i, b n ) T 


k = 1,..., n 


(2.7) / x k P k (x)dp(x) = ai...a k , k = l,...,n — l, 

Jr 

(2.8) / x k+1 P k (x)dp(x ) = ai...a k (bi-\ - \-b k+ i ); k = 0,..., n - 1, 


where p is any measure with first 2n — 1 moments given by (mi,..., m 2n _i) [see for example Wall 
(1948)]. Note that P\(x),..., P n (x) are orthogonal polynomials with leading coefficient 1 with 
respect to the measure p. It is now easy to see that the determinant of the Hankel matrix can be 
represented as 

k 

(2.9) det H_ 2k = det (m i+j ) k j=0 = a k ~ ]+l . 

3 = 1 

In the following section we will equip these moment spaces with distributions. We begin with the 
moment space corresponding to measures on bounded intervals. 


2.2 Distributions on moment spaces 

Chang et al. (1993) considered a uniformly distributed vector on the set _M„([0,1]) and showed 
that an appropriately standardized version of a projection U k (m n ) onto its first k components is 
asymptotically normal distributed, where the centering has to be performed with the moments 
of the arcsine distribution. A key ingredient in their proof is the following lemma, which shows 
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that the canonical moments of a uniformly distributed vector m n on Af n ([0,1]) are independent 
[for a proof see Dette and Studden (1997)]. For this and the following statements we will make 
the dependence of the canonical moments on the dimension of the moment space A4 n {[ 0,1]) more 
explicit. More precisely, we use the notation p n ,e(m n ) instead of pfim n ), and the symbol (3(a,b) 
denotes a Beta-distribution on the interval [0,1] with density 

/ [0 ,i](x)x a-1 (l - x) b ^/B(a, b). 

Lemma 2.1. For a uniformly distributed random vector m n on the nth moment space A4 n ([0, 1]) 
the canonical moments p n ,i(w*n),... ,p n>n (mn) defined by (2.1) are independent and Beta-distributed, 
that is 

Pnfirrin) ~ /3(n — i+l,n — i + l) . 

Note that the mapping between the (regular) moments and the canonical moments has only been 
defined on the the interior of M. n {[ 0,1]). However, Al n ([0,1]) is a closed, convex set and therefore 
its boundary has Lebesgue measure 0. Since we endow this space with the uniform distribution, 
the random variables p n ^ are a.s. well-defined. We also note that Dette and Nagel (2012) defined 
more general distributions on .M n ([0,1]), which contain the uniform distribution as a special case. 
In order to define an analogue of the uniform distribution on the unbounded moment space 
_M n ([0, oo)) these authors use the relation (2.3). To be precise, consider a random vector m n 
and denote the quantities in (2.3) by z n ^(m n ),..., z n)n (m n ). A density on the moment space 
_M n ([0, oo)) is then defined by 

n 

(2.1-0) g t ff ,s \m n ) = JJ^ nifc (m n ) 7 "' fc exp(-5 nifc ^ nifc (m n ))l Kfc(mri)> o}, 

k= 1 

where the constants satisfy 7 n ^ > — (n — k + 1), 5 n y. > 0 for k = 1,..., n, and the normalizing 
constant is given by Cn’°° ] = IIfc=i {^nk n ~ k+1 )I^^Ctn,k + n — k + 1)). The analogue of Lemma 2.1 
is now provided by the following result, where the symbol y(a, b ) denotes a Gamma distribution 
(a, b > 0) with density 

l [0 ,oo)(x) • 

T(a) 

Lemma 2.2. For a random vector rrin with density (2.10) on AL„([0, 00 )) the canonical moments 
Zn,k{ m n) defined by (2.3) are independent and Gamma-distributed, that is 

z n ,k(rnn) ~ 7 ( 7 ^ + n- k + 1 , 6 n , fe ), k = l,...,n. 

A proof of Lemma 2.2 can be found in Dette and Nagel (2012) and we conclude this section with 
the corresponding statements for the moment space A^n-i^)- Following Dette and Nagel (2012) 
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we define a distribution on A^ 2 n-i(®) by 


^2n-l( m 2n-l) = fj j^^exp (-5 n ,2fe-l^,fe(m2n-l)) 

k=1 * ^ 

n_1 2fe 7n - fc+2n_2fe 

(2.11) X 2 ?7 _ 2juj fl n,fc (^2n—l) ex P ( — ^n,2k a n,k(' m 2n-l)) l{a„ : fc ( m 2 n-l)> 0 } 5 

where the constants satisfy 7 ^*, > — 2(n — fc) for k = 1,..., n — 1 and 5 U) i,..., hi^n-i > 0. The 
distribution of the corresponding quantities a k and b k defined by (2.7) and (2.8) is specified in the 
following result. 

Lemma 2.3. Let m 2 n-i G A^n-^R) a random vector with density defined in (2.11). 

Then the random coefficients (b n> i, a n> i ..., a njn _i, b n ^ n ) T defined by (2.7) and (2.8) are independent 
and 


b n ,k ~ A/"(0, 2 ^- 7 ), an,fc ~ 7(7n,fc + 2 n 


2/c, S n ^2k) ■ 


Remark 2.4. There exists an interesting relation to random matrix theory in particular to the (d- 
ensembles considered by Dumitriu and Edelman (2002); Edelman and Sutton (2008); Ramirez et al 
(2011) among others. To be precise, consider exemplarily the moment space _A/f 2 n-i(R)- If can be 
shown that for a point m 2n _i G Int ,A 42 n-i(R) the polynomials defined in (2.5) satisfy the three 
term recurrence relation 

( 2 . 12 ) xP k (x) = P k +i(x) + b k+1 P k (x) + a k P k _fix); 

(. Pq{x ) = 1, P\(x) = x — bi), where the coefficients in the recursion are defined by (2.7) and (2.8). A 
straightforward calculation now shows that the polynomial P n (x) is the characteristic polynomial 
det(xl n — A n ) of the matrix 


(2.13) 


/ b\ y/dfi 


A n — 


\ 

\fr> 

y/(ln-2 b n — i ^/a n _i 

yja n ~\ b n ) 


If rrb 2 n-i is a random vector on A 42 n-i(R) with density dehned in ( 2 . 11 ), and 5 n ^k-i = 1/2 

(k — 1,..., n), d nt 2 k = 1, 7 n ,k — — 2 fin — k) {k — 1,..., n — 1) for some (3 > 0, then it follows 

from Lemma 2.3 that the coefficients in this matrix are independent with distributions b % ~ 
AA(0,1), a i ~ !xg( n -i)- This means that P n (x) is the characteristic polynomial of the random the 
matrix (2.13) corresponding to the (d- Hermite ensemble as introduced by Dumitriu and Edelman 
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(2002). While the common matrix literature investigates spectral properties of this matrix, the 
random Hankel determinant corresponds to a product of L 2 -norms of the (random) polynomials 
Pi,..., P n , that is 

n „ n 

t / Pt^Mdx) = n<- +i . 

i=1 i=l 

where /i denotes a random measure whose first 2n — 1 moments are defined the random Jacobi 
matrix (2.13). 

We also note that a similar interpretation is available for the random moment sequences on 
-^2n-i([0,1]) and _Ad2n-i([0, oo)) observing the results of Killip and Nenciu (2004) and Dumitriu and Edelman 
(2002), respectively. 


3 Weak convergence of Hankel determinant processes 

Throughout this section we investigate the asymptotic properties of the stochastic process 
(3-1) {D n: 2[nt\ ( m 2n)}te[0,l] = {l°g det H_ n p\nt\ { rn/ 2n) }te[0,l] 

where H_ n2 y nt Anv 2 n ) = (rrq+j(m/ 2n ))is the Hankel determinant of a random vector rri 2 n 
on the moment space M.2n(I)- We also investigate the asymptotic properties of the vector 
( D n 2 (rrv 2 n ),..., P ?l , 2 fc( m 2 n)) for some fixed k E N. In the following discussion we treat the cases 
of a bounded and unbounded moment space separately. 

In the following discussion the symbol Y n A- Y denotes weak convergence of a vector valued 
sequence of random variables (Y n ) n£ N . Moreover, let 

^°°([0,1]) = {/ : [0,1] —>■ M | \\f\\oo < oo} 

denote the space of bounded real-valued functions on the interval [0,1], with the topology induced 
by the uniform norm || • ||oo- We denote by X n ==^> X the weak convergence of a sequence (W n ) neN 
of random variables in Z°°([0,1]) to an Z°°([0, l])-valued random variable X. This is the mode 
of convergence described in Chapter 1.5 of van der Vaart and Wellner (1995). We also use the 
convention 0 log(0) := 0 and denote by s A t resp. s V t the minimum resp. maximum of s,t E R. 

3.1 Hankel determinants from Al2n([0,1]) 

Consider a uniformly distributed random vector m^n on At2n([0,1]), that is m 2n ~ U(Xi. 2 n )- We 
first investigate the weak convergence of the vector 

(3.2) H n k (Pn,2(^'2n)) ■ ■ ■ j l^n,2fc(^'2n)) 


for a fixed k G N. 


Theorem 3.1. If m^n r ^ld(M. 2 n), then the random vector 

V^{H n , k -H° tk ) 4^(0,E fc ) , 

where H® k = (L>° 2 ,..., D° n2k ) T and D Q n 2k denotes the log-determinant of the Hankel matrix cor¬ 
responding to the arcsine distribution, that is 

(3.3) D° nt2k = logdet ( V 2 ^')) = - k(2k + 1) log(2), 

'\ 1 + 3 J i,j= 0 ,—,k 

and the asymptotic covariance matrix is given by 

(3.4) E* = (i A j)*,,, . 

Proof: In all proofs of this paper we do not reflect the dependence of the canonical moments on the 
vector of random moments and use the notation p 2n ^ = P2n,i( m 2n)- According to Lemma 2.1 the 
canonical moments p 2n ^ are independent and f3(2n — i + 1, 2n — i + l) distributed (i — 1, 2,..., 2n). 
Since q 2n ,i = 1 - P2n, i rsJ /3(2n — i + 1, 2n — i + 1 ) it follows from Lemma A.l and the Delta method 
that 

(3.5) V4n(log(g 2 „,fc) - log(|)) A A/”(0,1) 

(3.6) V / 4n(log(p 2n ,fc(l - P2n,k)) ~ log(|)) A 0 

Next, note that the representation (3.3) follows from (2.2) and the fact that the canonical moments 
of the arcsine distribution are all given by 1/2. Consequently, we can decompose the vector H n ^ 
as follows 


V^(H n>k - H° njk ) = S n — T n , 

where the components of the vectors S n and T n are given by 

i 

Sni = ^ \An(z - j + 1) (log(g 2 n, 2 jP 2 ra, 2 j) + log(g 2 n )2 j-lP 2 n ) 2i-l) ~ 21og(±)) , 

3 = 1 

i 

T n i = ^^(log (g2n , 2j ) - log(i)) , 

3 = 1 

respectively (i — 1,..., k). Observing (3.6) we see that S n converges in probability to 0. The weak 
convergence T n A A/"(0, £&) is a routine exercise that follows from (3.5) and the independence of 
the q 2n ,i (i = 1,..., 2n). 

□ 


While Theorem 3.1 holds for any hxed k G N, the following result provides a process version. 
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Theorem 3.2. Let m^n denote a uniformly distributed random vector on 1]), then 


{Qn (t) } ig[ 01 ] := -^^Dn,2lnt\{'rn 2 n) ~ Dn,2[nt\ + = ’ > ^ (^)}*e[0, 1 ] J 

where D n ^\_ n t\ ( rn 2 n) Is defined in (3.1), 

(3.7) r(t) =t + (1 -t)log(l -t) 

and fyA 1 ] is a centered continuous Gaussian process on the interval [0,1] with covariance kernel 

sAt 

(3.8) f(s,t) = f (t 7^p a) dx = (a A t)(2 - s V t) - (s + t - 2) log(l - s At) 

o 


Proof: It is shown later (more precisely, in the proof of (3.10)) that the kernel / is in fact 
nonnegative definite, that is for all k G N, s±,..., Sk,t\,... ,tk G [0,1] the matrices (/(s*, h/))(j =1 
are nonnegative definite. A simple calculation shows that E[(£7t 0,1 l(2) — ^’^(s)) 4 ] < 48(2 — s) 2 , 
and consequently the existence of the process = {^^ 0,1 ^)}<e[o.i] follows from Theorem 3.23 in 
Kallenberg (2002). Moreover, since f/A'l j s continuous and /°°([0,1]) is a complete space, Theorem 
1.3.2 in van der Vaart and Wellner (1995) shows that is tight. For the following discussion 
we define 


£n,i — log(g2n,2*P2n,2i) + log(?2n,2j-lP2n,2i-l )i 

€n,i(t) = -7=(L nt J “ i + ~ E[ln,*]), 

a Jn 

and obtain by (2.2) the decomposition 

(3.9) Quit) = S n (t ) + 2R n (t) — 2 T n (t) + 2 U n (t), 

where the processes S n , R n , T n and U n are dehned by 


S n {t) 

T n (t ) 
R n (t) 
U n (t) 


[nt J — 1 

i =1 


[nt] — 1 


[nt\ log(2) — log(p 2 n,2Lntj) + log(g 2 n,2i) }, 


i= 1 


I{nt > 1} 


n 
[nt J — 1 


l°g(?2n,2|ntj - lP2n,2[nt] -1) , 


~7^{ (L nt J “ i + - <2 L n*j + + iMl lo 

^ 2—1 
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respectively. With these notations the proof of Theorem 3.2 follows from the assertions 


(3.10) 

s n = 

=* e 10 ' 11 

(3.11) 

T n = 

=^o, 

(3.12) 

-Rn = 

=^0, 

(3.13) 

r„i 

1 n—>-oo 

| oo 


and a simple application of Slutsky’s theorem. 

Proof of (3.10). For each fceN consider 0 = to < t\ <...< tk < 1 and define the fc-dimensional 
random variable S* := (S n (ti),..., S n (tk)) T . Let c = (ci,..., Ck) T € be an arbitrary vector, 
then 

k k |n*d ~ 1 fe % [nhj - 1 

C S n = y ] CjS n (tj) = y ' Cj fn,j(ti) = Ci fn,j(ti) 

i =1 2=1 i=l i=l Z=1 j=|_ntj_iJVl 

2 & |n*iJ ^ & 

= — 7n ^ ^ (£n>j — El Cj(L n ^*J — J + 1) 

Z=1 j=Lntj_!jVl i=z 

In order to calculate the variance of c T Sf we assume 0 < s < t < 1, use the approximation (A.2) 
in the Appendix and obtain 


. RsJ- 1 

co v(S n (s), S n (t)) =- V (|ratj - i + l)(|ra«J - * + 1) Var(| n>i ) 
n 

i=l 
[ns]—! 


-- E 

i =1 


([ntj - i + l)([nsj - i + 1) 


4(n — i + l) 2 


[ns J — 1 


+ - ^ QntJ - i + l)(|nsj - i + 1)0 ((n - i + 1) 3 ) 


i =1 


[nsj-l 0 _ »-l+rit-LntJ \ / 

1 V__ n J V 


i—1+ns— |nsj 


[nsj — 1 


(i 


»~1N2 


2=1 V n ' 1=1 
Interpreting the first term as Riemann-sum, we can calculate the limit 


+ - E Oftn-i + l)- 1 ; 


lim cov(S'„(s), S n (t)) = 


(1 — x) z 


which gives 


lim Var(c T S'*) = lim c T cov(S*, S*)c —* c T Sc , 

n—>■ oo n—>-oo 


li 








where the matrix £ is given by £ = (/(£*, £j))ij=i,...,fc and the covariance kernel / is defined in 
(3.8). Consequently we obtain that this kernel is nonnegative definite. 

We now prove the weak convergence of c T S* by verifying the Lyapunov-condition. For this purpose 
we use the notation c* := max{|ci|,..., |c n |} and obtain 


n“ 


->4 k \nti\—l k 

£ £ E[K» J --E[i,j]) 4 ](£ci(KJ-i + i) 

1=1 j=|n^-ijvi i=l 

q 4 k [ntii-l 

<^£ £ E [k»,j- E fe,i] 4 ](^*) 4 (»-i + i) 4 < 


(2 kc*) A C 2 


1=1 j= VI 


n 


y 0 ) 


where we have used the estimate (A.5) in Appendix A for the moments. Consequently, Lyapunov's 
Theorem implies convergence of the finite dimensional distributions, that is 

S* n = (S' n (f 1 ),...,,S n (4)) T 4Ar(0,£). 


We finally prove that S n is asymptotically tight, that is 

(3.14) lim limsupP (cu n (A) > e) =0, 

m—>oo ri—>oo 

where co n (a) = sup{|S' n (t) — S n (,s) \ 0 < t — s < a} denotes the modulus of continuity of the 
process S n . The statement (3.10) then follows from Theorem 1.5.4 in van der Vaart and Wellner 
(1995). For a proof of (3.14) we introduce the notation 

i < |_nsj — 1 

|_nsj — 1 < i < [nt\ — 1 , 
else 

and obtain the following representation 

n 

Sn{t) - S n {s ) = d nii (| n ,i - E[|n,i])- 

i= 1 


dn,i 


n 


[nt\ — L^sJ 
\nt J — i + 1 
0 
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The inequalities (A.4) and (A.5) in the Appendix then yield 


E [(S„(() - S„(s)) 4 ] <C 2 (V< 


i=1 


' (n — i + l) 2 


[nsj — 1 


'-<*c?( 1 r ' {t - s+ " )2 i 1 f hxST 2 

- 1 H- i + I)2 + „ 2-. (1_ 4=1)2 

i=l v n n> i= ns IVI v ™ ' 


dx + 


<(2C) 2 (j {.>g}(«-« +a 2 - 1 

V 1 l 1 - * + n) 

n 

< (2C) 2 (/{s > l) (t -s + ±) 2 (\_4vn - 1) + (t - s + i) ) 


(|_utj — [nsj )\ 2 


(3.15) < (4C) 2 (« - s + l) 2 . 

Consequently, we obtain 


m m (' A /'"'’') 2 

(3.16) lim 3 up^B[(S„(i)-S„(tl)) 4 ] <lim 3 up^(4C) 2 (l; + i) = K —f. 

n—>oo fc—i n—>oo ^ ^ ’iL 

Now assume that 0<r<s<£<l. If t — r > i, Holders’s inequality and (3.15) yield the 
estimate 


E [(S„(s) - S n (r)) 2 (S n (t) - S„(s)) 2 ] < (4C) 2 (a - r + 1) (i - s + i) 

<(4C) 2 (l^ + l) 2 <(6C) 2 («-r) 2 , 

which also holds if t — r < j (because we have S n (r) = S n (s) or S n (s) = S n (t) in this case). 
Therefore Lemma 3.1 in Shorack and Wellner (1986) and (3.16) show that 

limsupP (u n (A) >e) < limsup -U X^E[(S n (£) - S n (^I)) 4 ] + A ^ } 

n—>oo n—too tv nt j 

k= 1 

(4 Cf + K(6C) 2 
e A m 

for an absolute constant K. This proves (3.14) and completes the proof of (3.10). 

Proof of (3.11) and (3.12): These statements follow by similar arguments as given in the proof of 
assertion (3.10) using the estimates (A.l) - (A.7) in Appendix A. The details are omitted for the 
sake of brevity. 

Proof of (3.13): By (3.3) we have 

D n,i\nt\ = -( 2 N + !)M lo g(2) 1 
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and the estimate (A.l) from Appendix A yields the approximation 


[nt J — 1 \nt\ — 1 

(M - i + l)E[| n ,i] = (|nt| - i + 1)( - 41og(2) - 2(n _\ +1) + 0( (2n _ l 2i+l)2 )) 

i= 1 2=1 

= —2 log(2)(|_nfj 2 + 2[ntJ) + log(16) - - G n - [nt j+i) 

+ 0(log(n)) 


(uniformly with respect to t G [0,1]), where G n = y ^”_ 1 j is the nth partial sum of the harmonic 
series. Therefore 

u n (t) = 2 =(==y + 1(1 _ t) iog(i - 1 ) + - G„_ lntJ+1 )) + 


Using the approximation G n = log(n) + 7 + O(^), where 7 is the Euler-Mascheroni constant, we 
can easily see that 


Unit) = - t) log(l - t) - log (1 - t + !«!)} + o(!^M) 


-n(l — t) 


log (1 + 4gHr) + 


o 


log(n) 


= o(l) , 


2 y/n ~ v " J / v y/n 

uniformly with respect t € [0,1], which completes the proof of Theorem 3.2. 


□ 


Remark 3.3. Similar results as stated in Theorem 3.1 and 3.2 can be obtained for the Hankel 
matrices H 2n+1 = (m i+j+ i)^- =0 , H 2n = ( m i+j+1 - m i+j+2 )™J= 0 and H 2n +1 = (m+j ~ m i+j+1 )l j=0 , 
which are commonly used to characterize Hausdorff moment sequences [see Karlin and Stridden 
(1966)]. The details are omitted for the sake of brevity. 


3.2 Hankel determinants from Al2n([0, 00 )) and .M2n(®0 

In this section we will derive analogues of Theorem 3.2 for random moment sequences on un¬ 
bounded moment spaces, where the corresponding distributions are defined by ( 2 . 10 ) and ( 2 . 11 ), 
respectively. For the sake of brevity we omit the discussion of D nj2 k(m 2n ) for fixed k (correspond¬ 
ing results can be easily obtained using similar arguments as given in the proof of Theorem 3.1) 
and concentrate on the stochastic process {D n> 2 [ n t\ (m 2 n)}te[o,i]- 

Theorem 3.4. Let mo. n denote a random vector on At 2 n([ 0 ,oo)) with density gn defined in 
( 2 . 10 ), where 72 n,i 5 • • • 72 n, 2 n > 0 are bounded by a constant which does not depend on n and 
d 2n ,i = 2n-i+l + 72 n ,i, then 

{Gn(t )} t6 [ 0)1 ] := |--Dn,2L^j( m 2«)} tg[0 == ^ > {^' '^)}te[0,l] 
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where D n ^\nt\ (w^n) is defined in (3.1) ; and is a continuous Gaussian process on the interval 

[0,1] with mean —r(t)/2 and covariance kernel 

sAt 

(3.i7) g(»- t )=/ (t ~r_7 a) ‘ fe 

0 

= -(s A t)(s + t — 2 + s V t) + (s — l)(t — 1) log(l — s At) 

Proof: We will use the decomposition -D n ^\nt\ {min) — Sn'°°\t) + Rn’°°\t ), where the processes 
and Rn ’°°^ are dehned by 

S^°°\t) = ^ (D n> 2Lntj(»W2n) ~ E[-D n> 2LntJ (»«2n)]) 

i4° ,OO) (0 = ^[Dn^nt^n)] 

Observing the fact that bX ~ 7(a, 1), whenever X ~ 7(0, 6), and using the approximations (A.8) - 
(A. 10) from the Appendix it can be shown by similar arguments as given in the proof of Theorem 
3.2 that converges weakly to a centered continuous Gaussian process on the interval [0,1] 

with covariance kernel (3.17). For the remaining term Rn'°°\t) we use (2.4), Lemma 2.2 and the 
approximation 


E(log(2 2 n,i)) = E [log (z 2 n,i ■ (2n - i + 1 + 7 2 n,i))] - log(2n - i + 1 + 7 2n ,i) 

1 


2(2 n — i + 1 + 72 n ,i) 

This yields (uniformly with respect to £ € [0,1]) 


+ 0((2n — i + 1 + 72n,i) )■ 


LntJ 

i =1 


^ l---j / ^ [ntj —nt+1 £ i_ [ntj —ra£+l 


+ 


47T ^ \ 1 2. 72n,21+1/2 J — + 


72n,21-1 + 1 


o 


log(n) 


n 


and a careful calculation shows that this term converges uniformly to —r(t)/ 2, where r(t) is dehned 
in (3.7). This yields the assertion. □ 


We conclude this section with a corresponding result for the moment space A4 2n (M). The proof 
is similar to that of Theorem 3.2 and therefore omitted. 

Theorem 3.5. Let rri 2 n-i denote a random vector on A4 2n -i(E) with density hn defined in 
(2.10), where 7 n ,i, ■ ■ ■ 7n,n are bounded by a constant which does not depend on n and S n ^ t = 
2n — 2i + 7 n> i, then 

{-Ai,2L(n-l)tj(m2n-l)} te[oi] =► {£*(*) }ie[ 0 ,l] 

where D n ^\{n-i)t\ (‘ m 2 n) is defined in (3.1), and is a continuous Gaussian process on the interval 
[0,1] with mean —r(£)/4 and covariance kernel g(s,t)/2, defined by (3.7) and (3.8), respectively. 
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4 Large deviations 


Throughout this section we consider large deviation principles (LDP) for the moment space 
A^2n([0,1]). Similar results can be obtained for moment spaces corresponding to unbounded 
intervals. For fixed k the sequence defined in (3.2) for a uniformly distributed vector 

rrv 2 n on the moment space A^nQO, 1]) satisfies an LDP with a good rate function. To see this, 
observe that the sequence of canonical moments (V n ) neN = ((p 2 n,i, • • • ,P 2 n, 2 k) T )n.m satisfies a large 
deviation principle with good rate function 

2k 

i( x ) = 2 log ( Xi ” x i) ~ lo s( 4 )) 

i—1 

(c.f. Gamboa and Lozada-Chang (2004)). As the function that maps the canonical moments 
to the logarithms of the Hankel-determinants is obviously continuous, the contraction principle 
[Theorem 4.2.1 in Dernbo and Zeitouni (1998))] shows that (H^neN satisfies an LDP with a good 
rate function. However, due to the complicated form of this map it is not possible to explicitly 
represent the corresponding rate function in terms of standard functions. 

The investigation of LDP-properties of the logarithm of the lower Hankel determinant with in¬ 
creasing dimension turns out to be substantially more complicated, and we consider again the 
process {D nt2 \nt\ ( m 2n)}te[o,i] ; which has to be normalized differently, that is 

Z n {t) — ~~{D n 2[nt\('IT r ^2n) ~ ^n,2\nt\) i 

where q Uy o = 1. Let «S( [0,1]) denote the space of all signed regular Borel measures on the interval 
[0,1] endowed with the weak-*-topology (with (C([0,1]), || ■ ||oo) as predual). Then its (topological) 
dual space is the space C([0,1]) of all continuous functions on the interval [0,1]. In the following 
we interpret the process Z n as the distribution function of a random measure v n G <S([0,1]). To be 
precise note that the process Z n is piecewise constant with jumps at the points 4 ,..., n. Therefore 
v n is a linear combination of Dirac-measures and a simple calculation shows that 

^ n i i—1 

(4.1) ■'» = --£{£ log(4g 2 n. 2 j — lP 2 n. 2 j — 1 ) + log(4g 2 n,,2jP2n,2j) + log(2p 2 n,2i) , 

i= 1 j= 1 j= 1 

where 5 t denotes the Dirac measure at the point t G [0,1]. In order to investigate the large 
deviation properties of the sequence of random measures {z/ n } ne N we first derive the limit of the 
(normalized) logarithmic moment generating function. 


Theorem 4.1. Let u n denote the random measure defined in (4-1)- For any Riemann-integrahle 
function f G Z°°([0,1]) we have 


A(/) = lim -logE [ e nu Ff)} 

n^-oo 77 , 


— / lo g 

0 


IS))* A ' < 2 

K > 2 
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where 


i 

G(x) = I fit) dt ; 

X 


K 


sup 

ze[o,i) 


G(x) 

1 — x 


It is in general unknown what happens in the case K = 2. 


Proof: Interpreting the sequences 

n n 

X n,j = ~'52f(i) and Vn.J = ~'52 f(i) 
i=j i=j+ 1 

as Riemann-sums, we get the approximations 

(4.2) sup |G(i) + ^i|^0, sup |G(A)+^|^>0. 

j=l,...,n j=l,...,n 

This yields for the logarithm of the moment generating function 

^log (E[exp(ni/„(/))]) 

1 n 

= ~ 52 ( l0g ( E [V2nSj-lP2nij-l\) + lo S ( E ) + ( 3x »j + Vnj) log(2)) . 

3 =1 

For the determination of the limit we now consider the two cases K > 2 and K < 2 separately. 

(1) In the case K > 2 we choose constants S, C > 0 such that for all sufficiently large n there 

exists a j n 6 {1,... ,n} with 1 — ^ > C and G(&) > (2 + d)(l — ijf) (this is possible since the 
function G is continuous). Choosing another constant 0 < e < 5C and considering (4.2), we get 
the following approximation for sufficiently large n: 

ElMn < ]*njn +G (i)|_ G (fc) < e _ (2 + d)(l - &) . 

Therefore 2 n — 2 j n + 1 + x n j n < 1 + n(e — 5C) < —1, which yields E[5^2)"_ 1 Ihfffi -1 ] — 00 and the 
assertion follows. 

(2) I11 the case K < 2 we use the formula 


l°g(F(x)) = {x-\) log(x) - x + lQg ^ + 2f 0 (x) , 

where 


(4.3) 


<P oW 


OO 


0 


arctan (-)) 
exp(27rt) — 1 


dt 
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[cf. (4.3) in Dette and Gamboa (2007)]. Using the representation (4.1) we can show that 


(4.4) —E[exp (nu n (f))] = B nA + B n>2 + B nA + B nA 


n 


T R(2n 2 j T 2 : x nAl x n j) -\- R(2n 2j T \^x nA ^y n j') ? 


where 


b ».‘ = -^E 1 o U 1 + 

3 = 1 

1 U 

5 "- 2 = -buE lo s ( 1 + 


2n—2j+2 I ’ 


3 = 1 


2n—2j+l I 1 


B Ui3 = - ^(2n - 2j + 1 + x nj ) log fl + 


3=1 

n 




2(2n—2j+l)+a: n j +y n ,j J ’ 

B M = ^ E( 2n - 2 7 + 1 + y n j - |) log (l + 2(2n-2 J -%+2 tt , J - +y „, 3 - ) ’ 


3=1 


and the remaining two terms are defined by 

R(a, x, y) = 2(0o(a + x) - 0 o (a) + 0 o (a + j/) - 0o(a)) - 4(0 o (2a + x + y) - 0 O (2a)) . 

We now investigate the terms in this decomposition separately. The first term B nA can be inter¬ 
preted as Riemann-sum, using (4.2), that is 


1 " / G(i)+o(l) 

(4.5) 

3=1 


2( 1 -¥)+°( 1 ) 


A-- / log 1- 


G(x) 


2(1 — x) 


dx . 


Analogously, the second term converges to the same limit, i.e. 


(4,6) 


B 


n. 2 


+ -- / log 1- 


G{x) 


2(1 -x) 


dx 


For the the terms B n 3 and B nA we use the Taylor-approximation log(l + x) = x + 0(x 2 ) (x —>■ 0) 
and obtain 


(4.7) 

(4.8) 


3 — 


i Ahi-h-Gfa+oa) 


—T 

2n ^ 


B 


n, 4 


S 2 (l-j)-G(i)+“(!)' 

G(0) 


/(i) + 0 U 


n—»oo^ G(0) 


2 ’ 
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and it remains to show that the last two terms in (4.4) are asymptotically negligible. 

For this purpose we note that the following inequality holds for the function 0 O defined in (4.3) 
[cf. formula (4.10) in Dette and Gamboa (2007)] 


|0o {a + x) - 0o(a) | < C- 


\x\ 


a A (a + x)) 2 


with C = 


exp (27 rt) — 1 


dt , 


where a > 0,x > —a. This gives 


\R(a,x,y)\ < 2 C 


x 


+ 


\y\ 


(a A (a + x)) 2 (a A (a + y)) 


+ 4 C 


\x + y | 


(2a A (2a + x + y))‘ 


and using this inequality to estimate the terms R{2n — 2j + 2, x n j, x nj ) and R(2n — 2j + l, x n j, y n j) 
in (4.4) yields six terms, which have a similar form. For the sake of brevity we will only show 
exemplarily the convergence 


Dn-.= ~Y. 

n ' 


x. 


nj I 


“ (( 2n _ 2 i + 2) A (2n - 2j + 2 + x n j)) : 


+ 0 . 


The other five sums can be approximated in a similar way and the details are omitted. For 
sufficiently small e > 0 and sufficiently large n, we obtain by similar arguments as in the case 
K > 2: 


x n j > -(e + K)(n - j) - ej , 

2 n — 2 j + x nj > (2 — K — 2e)(n — j) . 

Choosing <5 = min{2 — K — 2e, 2} > 0, we get the inequalities 

(2 n - 2j + 2) A (2n - 2j + 2 + x nJ ) > 8(n - j + 1) , 
\x n ,j\ <n(\G(±)\ + e) < ||/|| 00 (n-j + l) + ne . 


This yields 

(4.9) lim sup D n < limsup< 

n .—Vnn n .—Vnn h 


E 


S 2 n *—' n 
j = 1 


+ 0E 


j + 1 “ (n - j + 1 ) 5 


oo ^ 

e \ ^ 1 


t \ ^ ± 

+2 2 -^ 12 • 
7 = 1 J 


Considering the limit e \ 0 on the right hand side of (4.9) we obtain that the last two terms in 
(4.4) converge to 0, and the assertion follows from (4.5) - (4.8). 

□ 


Remark 4.2. Note that for the application in Lemma 4.3 and Proposition 4.4, it would be 
suffivient to prove the preceeding theorem only for functions / G C([0,1]). However, we have 
chosen to prove it more generally for any Riemann-integrable function /, as this alios us to apply 
the formula for the limit in the proof of Theorem 4.5. 
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Lemma 4.3. The sequence (z/ n ) neN of random measures defined by (4-1) is exponentially tight. 
Proof: By the Banach-Alaoglu theorem the set 


K n 


|/i g «5([0,1]) 


sup /i(/) < a 
/ec([ o,i]) 
ll/ll<i 


is compact (note that we endowed <S([0,1]) with the weak-*-topology). We define the modified 
measure 

^ n i i— 1 

*4 =-■ { 5Z 1 °g( 4 92n,2j-lP2r l ,2j-l) + ^ lo g( 4 ?2n,2jP2r l ,2j) + log(p 2 n,2i) • 

i=l i=l j=l 


n 

Observing u n = u' n — l -^2l ^S± we can see v n (f) < v' n (f) + log(2) for all / G C([0,1]) with 

n i n 

Z=1 

11/||oo < 1. Since 1 / is a positive measure, we get by Markov’s inequality 
- logPK e if®) < - logP( sup i/ n (f) > a- log(2)) < - logE[exp(rw/(1))] - a + log(2) 

n n V /GC([0,1]) ' n 

ll/ll<i 

= -logE[exp(ra(i/ n (l) + log(2))] - a + log(2) A(l) -a + 21og(2) , 
n 

which yields the assertion. 

□ 


Proposition 4.4. Let A* be the Fenchel-Legendre transform of A and let E denote the set of all 
exposed points of A* which have an exposing hyperplane A that satisfies 

lim — logE[exp(z/ n (nA)] exists and A(yA) < oo for some 7 > 1. 

n—too 77, 

Then 

— inf A*(x) < lim inf — logP(z/ n G T) < lim sup — logP(i/ n G T) < — inf A*(x) 
xeEnr° ra->oo n n^oo n xeT 

for all measurable sets T C «S([0,1]). 

Proof: This follows directly from Baldi’s theorem [c.f. Theorem 4.5.20 in Dembo and Zeitouni 
(1998)]. 

□ 


The main difficulty in proving an LDP for the process {Z n (t)}te[ 0 , 1 ] consists in the fact that 
an explicit representation of the Fenchel-Legendre transform A* is not available. This makes it 
difficult to eliminate the set E in the lower bound in Theorem 4.4. On the other hand - in contrast 
to the LDP for the process {Z n (t)} t g[o.i] - the LDP for the random variable Z n (t) with a fixed t 
can be established. 
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Theorem 4.5. For a fixed t E (0,1] the sequence (Z n (t)) neN satisfies a large deviation principle 
with good rate function 

A*(x) = sup |Ax + log (l - dy} 

Proof: We will again apply Baldi’s theorem. To calculate the normalized cumulant generating 
function of Z n (t), note that 


A Z n [f) = u n (X /{• < t}) , 


and Theorem 4.1 yields 


A*(A) 


lim — log E[cxp(nAZ n (£))] 

n—>oo n 


\ — f* log (l 
A(A/{- < t}) = < Jo V 
I oo 


2(1 -y)) 


dy 


A < 
A > 


2 

t 

2 

t 


It now follows by similar arguments as given in the proof of Lemma 4.3 that the sequence (Z n (t)) ng ^ 
is exponentially tight (note that we can use the euclidean topology on K because the interval [0, a] 
is compact) and Baldi’s theorem yields an analogue of the inequality in Theorem 4.4, where the 
set E has to be replaced by an analogue set E t . It remains to prove that the lower bound remains 
correct if one removes the set E t . 

In order to see this, we define the new function 


{ M —> (—oo, oo] 

( MX) if A ^ f 

^j|im A t(A-e) if A = f 

At and A t have the same Fenchel-Legendre transform and it is therefore sufficient to prove 
(4.10) inf A*(x) = inf A* t (x) 

x£E t nF x£F 

for all open sets F Cl. It is easy to see that A t is an essentially smooth function and the iden¬ 
tity (4.10) follows by an adaptation of the arguments in the proof of the Gartner-Ellis theorem 
[Theorem 2.3.6 in Dembo and Zeitouni (1998)]. By Lemma 1.2.18 in the same reference the rate 
function A* is a good rate function, which yields the assertion. 

□ 
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Remark 4.6. It is possible also to prove the preceeding theorem more directly by an application 
of the Gartner-Ellis theorem, as the limit of A logE[e nAZ "^] can be calculated using Stirling’s 
approximation. However, these calculations would essentially be a repetition of the calculations 
done in the proof of Theorem 4.1, with /(•) replaced by XI{■ < t}. The present proof is shorter 
and given here for the sake of brevity. 

Our final result specializes Theorem 4.5 to the case t — 1, where the rate function can be de¬ 
termined explicitly. The proof follows by a straightforward calculation of Ai(A) and its convex 
conjugate. 

Corollary 4.7. The sequence (Z n (l)) ne N satisfies an LDP with good rate function 

2x — 1 — log(2x) x > 0 


I(x) = 


oo 


else 


A Auxiliary results 

In the proof of the results we make frequent use of the following approximations, which can can 
be derived from the approximations given in Dette and Gamboa (2007). Throughout this section 
C denotes a positive constant. 


(A.l) 
(A.2) 
(A.3) 
(A.4) 

(A.5) 


+ 4l0g(2) + 


Var (£ nti ) 


2 (n — i + 1) 
1 


< 

< 


C 


(2 n -2 i + l) 2 
C 


4 (n — i + l) 2 (2 n — 2i + l) 3 

|E[log(g„,i)] +log(2)| < 


n — i + 1 
C 

{n — i + l) 2 
C 2 

(n — i + l) 4 

Also, a direct estimate of the occurring integrals show for i < 2n: 

(A.6) 


Var (£ n ,i) < 

e[(^-e[^]) 4 ] 


< 


E [| log(g 2 n,*)| fc ] < 3 sup | log(x)| fc x < oo 
xe[o,i] 

Lastly, one can prove by differentiation under the integral that for a random variable A" P(a,b) 
(A.7) Var(log(A)) = ip^a) - ipi(a + h) 

where ipi(x) = fpj log(T(a:)) = x~ l + 0(x ~ 2 ) (x —> oo) denotes the trigamma function. 
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We also need to approximate the moments of gamma-distributed random variables. Using the 
notation d, : = we can see that 

Jrio g (r(* 0 ) = dj = iog(fe) - -t + o(v 2 ) 

^bg(r (k)) = d 2 -dl = i + 5 L + o(t- 3 ) 


d 4 

dk 4 


log(r(fc)) = d 4 - 3d 4 - 6dt - 4d 3 di + 12^d 2 = 0(/W 3 ) 


where the first part of the equations follows from formally differentiating the term, while the second 
part follows from the approximations of the polygamma functions in Abramowitz and Stegun 
(1964). If X ~ 7 (/c, 1), then for Y = log(X) the following equations hold 


(A. 8 ) 

(A.9) 

(A. 10 ) 


E[y] = d x = log(fc) - — + 0(k~ 2 ) 

Var(Y') = d 2 -dl = j + ^ + 0(k~ 3 ) 

E[(y - E[y]) 4 ] = d 4 - 4d!d 3 + 6 d\d 2 - 3d 4 = 3(d 2 - d\) 2 
3 


Lemma A.l. Let X n 


k 2 

(3(n,n). Then 


0(k 


— 3 \ 


0(k 


— 3 \ 


—>► A/"(0, |) 


Proof: The density of i/n (X n — is given by 




(A.ll) 


= B{n,n) — ) I 


n 4 


n 


By Stirling’s approximation 




7T 


T(2n) 2 2n_1 A /n 


holds, which implies that the density (A.ll) converges pointwise to the density of the A7(0, |)- 
distribution. An application of Scheffe’s theorem (c.f. Scheffe (1947)) yields the desired result. 


□ 
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